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Abstract. In this work we present a detailed analysis using the Markov chain theory of some 
versions of the truel game in which three players try to eliminate each other in a series of one- 
to-one competitions, using the rules of the game. Besides reproducing some known expressions for 
the winning probability of each player, including the equilibrium points, we give expressions for the 
actual distribution of winners in a truel competition. 

1. INTRODUCTION 

A truel is a game in which three players aim to eliminate each other in a series of one-to- 
one competitions. The mechanics of the game is as follows: at each time step, one of the 
players is chosen and he decides who will be his target. He then aims at this person and 
with a given probability he might achieve the goal of eliminating him from the game 
(this is usually expressed as the players "shooting" and "killing" each other, although 
possible applications of this simple game do not need to be so violent). Whatever the 
result, a new player is chosen amongst the survivors and the process repeats until only 
one of the three players remains. The paradox is that the player that has the highest 
probability of annihilating competitors does not need to be necessarily the winner of 
this game. This surprising result was already present in the early literature on truels, see 
the bibliography in the excellent review of reference [1]. According to this reference, 
the first mention of truels was in the compendium of mathematical puzzles by Kinnaird 
|0] although the name truel was coined by Shubik ^\ in the 1960s. 

Different versions of the truels vary in the way the players are chosen (randomly, 
in fixed sequence, or simultaneous shooting), whether they are allowed to "pass", i.e. 
missing the shoot on purpose ("shooting into the air"), the number of tries (or "bullets") 
available for each player, etc. The strategy of each player consists in choosing the 
appropriate target when it is his turn to shoot. Rational players will use the strategy 
that maximizes their own probability of winning and hence they will chose the strategy 
given by the equilibrium Nash point. In a series of seminal papers 0, El E]], Kilgour has 
analyzed the games and determined the equilibrium points under a variety of conditions. 

In this paper, we analyze the games from the point of view of Markov chain theory. 
Besides being able to reproduce some of the results by Kilgour, we obtain the probability 
distribution for the winners of the games. We restrict our study to the case in which there 
is an infinite number of bullets and consider two different versions of the truel: random 
and fixed sequential choosing of the shooting player. These two cases are presented in 



sectionsE]and|21 respectively. In section|l]we consider a variation of the game in which, 
instead of eliminating the competitors from the game, the objective is to convince them 
on a topic, making the truel suitable for a model of opinion formation. Some conclusions 
and directions for future work are presented in section |5] whereas some of the most 
technical parts of our work are left for the final appendixes. 



2. RANDOM FIRING 

Let us first fix the notation. The three players are labeled as A,B,C. We denote by a, 
b and c, respectively, their marksmanship, defined as the probability that a player has 
of eliminating from the game the player he has aimed at. The strategy of a player 
is the set of probabilities he uses in order to aim to a particular player or to shoot 
into the air. Obviously, when only two players remain, the only meaningful strategy 
is to shoot at the other player. If three players are still active, we denote by Pab, 
Pac and Paq the probability of player A shooting into player B, C, or into the air, 
respectively, with equivalent definitions for players B and C. These probabilities verify 
Pab + Pac + ^fco = 1 ■ A "pure" strategy for player A corresponds to the case where one 
of these three probabilities is taken equal to 1 and the other two equal to 0, whereas a 
"mixed" strategy takes two or more of these probabilities strictly greater than 0. Finally, 
we denote by n(a;b,c) the probability that the player with marksmanship a wins the 
game when he plays against two players of marksmanship b and c. The definition implies 
7t(a;b,c) = n(a;c,b) and 7i(a;b,c) +n{b;a,c) +7t(c;a,b) = 1. 

In the particular case considered in this section, at each time step one of the players 
is chosen randomly with equal probability amongst the survivors. There are 7 possible 
states of this system labeled as ABC, AB, AC, BC, A, B, C, according to the players 
who remain in the game. The game can be thought of as a Markov chain with seven 
states, three of them being absorbent states. The details of the calculation for the winning 
probabilities of A, B and C as well as a diagram of the allowed transitions between states 
are left for the appendix l6.ll We now discuss the results in different cases. 

Imagine that the players do not adopt any thought strategy and each one shoots 
randomly to any of the other two players. Clearly, this is equivalent to setting Pab = 
Pac = Pba = Pbc — Pca = Pcb =1/2- The winning probabilities in this case are: 

a. b c 

%{a\b,c) = , n(b;a,c) = , n(c,a,b) = , (1) 

a+b+c a+b+c a+b+c 

a logical result that indicates that the player with the higher marksmanship possesses the 
higher probability of winning. Identical result is obtained if the players include shooting 
in the air as one of their equally likely possibilities. 

It is conceivable, though, that players will not decide the targets randomly, but will 
use some strategy in order to maximize their winning probability. Completely rational 
players will choose strategies that are best responses (i.e. strategies that are utility- 
maximizing) to the strategies used by the other players. This defines an equilibrium 
point when all the players are better off keeping their actual strategy than changing to 
another one. Accordingly, this equilibrium point can be defined as the set of probabilities 
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FIGURE 1. In the parameter space (b,c) with c < b < a = 1, we indicate by black (resp. dark gray, 
light gray) the regions in which player A (resp. B, C) has the largest probability of winning the truel in the 
case of random selection of the shooting player and the use of the optimal strategy, as given by Eq. 10. 



P a a (with a =A,B,C and /3 =A,B,C,0) such that the winning probabilities have a 
maximum. This set can be found from the expressions in the appendix, with the result 
that the equilibrium point in the case a > b > c is given by Pab — Pca = Pba = 1 and 
Pac = ^40 = Pbc = Pbo = Pcb = Pco = 0. This is the "strongest opponent strategy" 
in which each player aims at the strongest of his opponents yj]. With this strategy, the 
winning probabilities are: 



, , v <f b c(c + Za 

7i{a;b,c) = — r, 7ilb;a,c) = , 7ilc;a,b) = — 

v ' (a + c)(a + b + c) v ' a + b + c v ' (a + c)(a + b + c) 

(2) 

(notice that these expressions assume a> b > c; other cases can be easily obtained by a 
convenient redefinition of a, b and c). 

An analysis of these probabilities leads to the paradoxical result that when all players 
use their 'best' strategy, the player with the worst marksmanship can become the player 
with the highest winning probability. For example, if a = 1.0, b = 0.8, c = 0.5 the 
probabilities of A, B and C winning the game are 0.290, 0.348 and 0.362, respectively, 
precisely in inverse order of their marksmanship. The paradox is explained when one 
realizes that all players set as primary target either players A or B, leaving player C as 
the last option and so he might have the largest winning probability. In Fig d we plot the 
regions in parameter space (b,c) (after setting a = 1) representing the player with the 
highest winning probability. 

Imagine that we set up a truel competition. Sets of three players are chosen randomly 
amongst a population whose marksmanship are uniformly distributed in the interval 
(0, 1). The distribution of winners is characterized by a probability density function, 
f(x), such that f(x)dx is the proportion of winners whose marksmanship lies in the 



interval (x,x + dx). This distribution is obtained as: 

f(x) = J dadbdc [%{a\b,c)8{x — a) + n(b;a 1 c)8(x — b) + 7t(c;a,b)5(x — c)] (3) 

or i 

fix) = 3 [ db [ dcn(x;b,c) (4) 
Jo Jo 

If players use the random strategy, Eq. ©, the distribution of winners is f(x) = 
3x[xlax — 2(1 +x)ln(l +x) + (2+x)\a(2+x)). In figure El we observe that, as ex- 
pected, the function f(x) attains its maximum at x = 1 indicating that the best marks- 
manship players are the ones which win in more occasions. 

We consider now a variation of the competition in which the winner of one game 
keeps on playing against other two randomly chosen players. The resulting distribution 
of players, f(x), can be computed as the steady state solution of the recursion equation: 

f(x,t + l) = J dadbdc [%(a;b 1 c)8(x — a) + 7t(b;a, c)8 (x — b) + 7t(c;a,b)8(x — c)]f(a,t) 

(5) 

or ^ ^ 

f(x) = h(x)f(x)+2 [ db I dcx(x;b,c)f{b) (6) 
3 JO Jo 

In the case of using the probabilities of Eq. ^ the distribution of winners is 1 f(x) = 2x. 

For players adopting the equilibrium point strategy, Eq.©, the resulting expression 
for f(x) is too ugly to be reproduced here, but the result has been plotted in Fig.|3] Notice 
that, despite the paradoxical result mentioned before, the distribution of winners still has 
it maximum at x = 1, indicating that the best marksmanship players are nevertheless the 
ones who win in more occasions. In the same figure, we have also plotted the distribution 
f(x) of the competition in which the winner of a game keeps on playing. In this case, 
the integral relation Eq.® has been solved numerically. 



3. SEQUENTIAL FIRING 

In this version of the truel there is an established order of firing. The players will 
shoot in increasing value of their marksmanship, i.e. if a > b > c the first player to 
shoot will be player C, followed by player B and the last to shoot is player A. The 
sequence repeats until only one player remains. Again, we have left for the appendix 
16.21 the details of the calculation of the winning probabilities. Our analysis of the 
optimal strategies reproduces that obtained by the detailed study of Kilgour[5]. The 
result is that there are two equilibrium points depending on the value of the function 
g{a,b,c) =a 2 ( \ —b) 2 (l — c) — b 2 c — ab{\ — be): if g{a,b, c) > the equilibrium point 



The result is more general: if n(a;b,c) — G(a)/[G(a) + G(b) + G(c)}, for an arbitrary function G(x), 
the solution is f(x) — G(x)/ J G(y)dy. 
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FIGURE 2. Distribution function f(x) for the winners of truels of randomly chosen triplets (solid line) 
in the case of players using random strategies, Eq. Q; distribution f(x) of winners in the case where the 
winner of a truel remains in the competition (dashed line). 
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FIGURE 3. Similar to Fig.(|2} in the case of the competition where players use the rational strategy of 
the equilibrium point given by eq.Q. 



is the strongest opponent strategy Pab = Pba — Pca = 1> while for g(a,b,c) < it turns 
out that the equilibrium point strategy is Pab = Pba = Pco = 1 where the worst player C 
is better off by shooting into the air and hoping that the second best player B succeeds 
in eliminating the best player A from the game. 



b 

FIGURE 4. Same as FigQ] in the case that players play sequentially in increasing order of their 
marksmanship. 



The winning probabilities for this case, assuming a> b> c, are: 

(1 -c)(l-b)a 2 



n(a;b,c) 
7l(b;a,c) 



[c(l -a) +a] [b(l-a) + a]' 

(l-c)b 2 

(c(l -b) + b)(b(l- a) + a) 



c[bc + a[b(2 + b(-l + c)-3c)+c]] 
{C ' ' } " [c + a(l-c)}[b + a(l-b)}[a + b(l-a)y K ) 



if g(a,b, c) > 0, and 
%{a\b,c) 
n(b;a,c) 



a 2 {\-b){\-c) 



[a+(l-fl)c] [a + b(l-a)+c(l-a)(l-b)] : 

b(b(l-c) 2 + c) 
[b+(l -b)c][a + b(l -a) + c(l -a) (I -b)\ 



ac{\-b){\-c) c(b+c(\-2b)) 
i i \ a+c(l— a) 1 b+ch—b) , ON 

% ^ a M = r TUT — VJTTi W u\v (8) 

[a + b( \ — a) +c(l — a){\ — b)\ 

if g(a,b,c) < 0. Again, as in the case of random firing, the paradoxical result appears 
that the player with the smallest marksmanship has the largest probability to win the 
game. In figure |4] we summarize the results indicating the regions in parameter space 
(b, c) (with a = 1) where each player has the highest probability of winning. Notice that 
the 'best' player A has a much smaller region of winning than compared with the case 
of random firing. 

In figure |5] we plot the distribution of winners f(x) and f(x) in a competition as 
defined in the previous section. Notice that now the distribution of winners f(x) has a 
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FIGURE 5. Same as Fig|2] in the case that players play sequentially in increasing order of their 
marksmanship. Notice that now both distributions of winners present maxima for x < 1 indicating that 
the best a priori players do not win the game in the majority of the cases. 



maximum at x ~ 0.57 indicating that the players with the best marksmanship do not win 
in the majority of cases. 



We reinterpret the truel as a game in which three people holding different opinions, A, B 
and C, on a topic, aim to convince each other in a series of one-to-one discussions. The 
marksmanship a (resp. b, c) are now interpreted as the probabilities that player holding 
opinion A (resp. B or C) have of convincing another player of adopting this opinion. 
The main difference with the previous sections is that now there are always three 
players present in the game and the different states in the Markov chain are ABC, AAB, 
ABB, AAC, ACC, BBC, BCC, AAA, BBB and CCC. The analysis of the transition 
probabilities is left for appendix 16.31 We consider only the random case in which the 
person that tries to convince another one is chosen randomly amongst the three players. 
The equilibrium point corresponds to the best opponent strategy set of probabilities in 
which each player tries to convince the opponent with the highest marksmanship. The 
probabilities that the final consensus opinion is A, B or C, assuming a> b> c are given 



4. CONVINCING OPINION 



by 



7t(a;b,c) 



a 2 [2cb 2 + a((a + b) 2 + 2(a + 2b)c)] 



(a + b) 2 (a + c) 2 (a + b + c) 



n{b;a,c) 



b 2 (b + 3c) 



7t(c;a,b) 



(b + c) 2 (l+b + c)' 

c 2 [c 3 + 3(a + b)c 2 + a(a + Sb)c + ab(3a + b)] 



(9) 



(a + c) 2 (b + c) 2 (a + b + c) 




b 



FIGURE 6. Same as Fig^for the convincing opinion model. 
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FIGURE 7. Same as FigEJfor the convincing opinion model. 

respectively. As shown in Fig.|6l there is still a set of parameter values (a, Z?, c) for which 
opinion C has the highest winning probability, although it is smaller than in the versions 
considered in the previous sections. 

Similarly to other versions, we plot in figure |7] the distribution of winning opinions, 
f(x) . Notice that, as in the random firing case, it attains its maximum at x = 1 showing 
that the most convincing players win the game in more occasions. We have also plotted 
in the same figure, the distribution f(x) which results where one of the winners of a truel 
is kept to discuss with two randomly chosen players in the next round. 



5. CONCLUSIONS 



As discussed in the review of reference J3], truels are of its interest in many areas of 
social and biological sciences. In this work, we have presented a detailed analysis of 
the truels using the methods of Markov chain theory. We are able to reproduce in a 
language which is more familiar to the Physics community most of the results of the 
alternative analysis by Kilgour[5]. Besides computing the optimal rational strategy, we 
have focused on computing the distribution of winners in a truel competition. We have 
shown that in the random case, the distribution of winners still has its maximum at the 
highest possible marksmanship, x = 1, despite the fact that sometimes players with a 
lower marksmanship have a higher probability of winning the game. In the sequential 
firing case, the paradox is more present since even the distribution of winners has a 
maximum at x < 1. It would be interesting to determine mechanisms by which players 
could, in an evolutionary scheme, adapt themselves to the optimal values. 



6. APPENDIX: CALCULATION OF THE PROBABILITIES 

6.1. Random firing 

In this game there are seven possible states according to the remaining players. These 
are labeled as 0,1,..., 6. There are transitions between those states, as shown in the 
diagram in Fig. [HI where pjj denotes the transition probability from state i to state j (the 
self-transition probability pa is denoted by r{). 



States 


Remaining players 
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AB 
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AC 
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BC 
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FIGURE 8. Table with the description of all the possible states for the random firing game, and diagram 
representing the allowed transitions between the states shown in the table. 

From Markov chain theory[7] we can evaluate the probability uj that starting from 



state i we eventually end up in state j after a sufficiently large number of steps. In 
particular, if we start from state (with the three players active), the nature of the game 
is such that the only non- vanishing probabilities are Uq, Uq and Wq corresponding to the 
winning of the game by player A, B and C respectively. The relevant set of equations 

is 2 : 



"0 = P01 u \ + PQ2 u\ + PQ3 u\ + r Q Uq, 
4 4 i 4 

u\ = pu ul + ri «j, 

4 4 i 4 

4 4 

u 3 = r 3 M 3' 

Solving for Uq, Uq and Uq we obtain: 



"0 = POl u \ + P02 «2 + P03 u 3 + r M 0' 
"l =P15 + n "i, 

u l = r 3 "5- 



POl PU P02 P24 

(l-r )(l-r 1 ) + (l-r )(l-r 2 )' 

POl Pl5 P03 P35 

(l-r )(l-r 1 ) + (l-r )(l-r 3 )' 1 ) 

P02 P26 P03 P36 

(l-r )(l-r 2 ) (l-r )(l-r 3 )' 

We can now derive the expressions for the transition probabilities p/y. Remember that 
we denote by a the probability that player A eliminates from the game the player he has 
aimed at (and similarly for b and c), and that P a a (a =A,C,B and /3 = A,B,C,0) the 
probability of player a choosing player /3 (or into the air if /3 = 0) as a target when it is 
his turn to play (a situation that only appears when the three players are still active). We 
have then: 



A = 
4 = 
4 = 



ro = 

P02 
P14 = P24 
P26 = P36 

r 2 = l-\ 



1 - i(a(l -P A0 )+b(\ -P B0 ) +c(1 -Pco)), 
= ^(ciP ab + cPcb) 



2 ' 

a + c) 



poi = ^(aPAC + ^c), 

P15 =P35 = \b : 

n = i — j( a +^)) 

r 3 = l-|(^ + c). 



(11) 



6.2. Sequential firing 

As in the random firing case, we describe this game as a Markov chain composed of 
11 different states, also with three absorbent states: 9 , 10 and 11. In Fig.|5]we can see 
the corresponding diagram for this game, together with a table describing all possible 
states. Based on this diagram, we can write down the relevant set of equations for the 
transition probabilities uj: 



There is no need to write down the equations for Uq since it suffices to notice that Uq + Uq + Uq = 1. 
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FIGURE 9. Table: Description of the different states of the game for the case of sequential firing. 
The highlighted player is the one chosen for shooting in that state. Diagram: scheme representing all the 
allowed transitions between the states shown in the table for the case of a truel with sequential firing in 
the order B — > A with a> b> c. 
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(12) 



The general solutions for the probabilities u Q , Uq and Uq are given by 

P59O03P35+P01I15) P79(P04P47 + PoiPnPn) ' 

1-P35P53 1-P47P74 J' 

P3W(P03+P0\P15P53) P01P810(P16P68+P12P28) " 

1-^35^53 1-P68P86 J' 

PAu(P0A+P01P12P27P7a) | P01P6 1 1 (Pl6 + P\2P2lP%b) 
I-PAIPIA 1-P68P86 

with transition probabilities given by 



P0\ 


= (1- 


-c)+cP C o, 


P03 = 




P04 


= CPCB, 


P\2 


= (1- 


-b)+bp B0 , 
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bP B A, 


P\6 


= bP CA , 
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= (1- 


-a) +aP A0 , 
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aP AB , 
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= oPac, 


P35 


= P86 
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= P810 = 
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6.3. Convincing opinion 



For this model we show in Fig.[TO|the diagram of all the allowed states and transitions, 
together with a table describing the possible states. 

The corresponding set of equations describing this convincing opinion model, as 
derived from the diagram, are 



"n = r u + PQi>u\ + Pqau\ + PQ5U l 5 + /?07" 7 , 

u\ = r ul + po4uj + p 05 uj + po&ul + poguj, 

ul = r ul + posul + p W U% + p 7«7 + P06"6' 

u\ = r A u\ + P4 5 u\ + P41 , 

u\ = r 5 u\+p 5A u\, 

u\ = r 6 u l 6 + p 67 u}j+p 6 i, 

ul = r 7 ul+p 76 ul, 

u l = r 8«i +/»89«9 +P82, 
Ug = r 9 ul+ P98"g, 



r4«| + p45«5, 

r 5 ul+ P5AU2 + P52, 

r6ul+P67UH, 

nuj+p76U$+p 7 3, 

rsul+pmt 
r 9 ul+p9sul+p 93 . 



(14) 



And the general solution for the probabilities u\, u\ and u\ is 
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P6l(P06(l-n)+Pff7Pl6) , P4l(P04(l -^5) +P05P54) 
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(1 -r 6 )(l -r 7 ) -P67P16 ' (1 -rgXl -r 9 ) - PS9P9S 



+ 



+ 



(l-r 8 )(l 

^93(^09(1 - 



-r 9 J -PS9P9S 
r&) +P08P89) 



(15) 
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FIGURE 10. Table: description of the different states of the opinion model. Diagram: scheme repre- 
senting the allowed transitions between the states. 



where the transition probabilities are given by 



\CPCA, 



POA 
P05 

PA1 = P61 = h 
P52 = P$2 = 3b, 

ro = I [3 — a — b — c] 
r 6 = |(l-c) + i(l- 
r 9 = !(l-&) + !(l- 



■a) 
-a) 



p 06 = jCPcb, 
PQ1 = 3«^4fi, 
PA5 = P9S = jb 
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3 a, 
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;i-c)+±(i-a), 

} -c) + -a), 
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P13 
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jbP BC , 
2 a PACi 
P16 
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3 a ' 



(1 
(1 



•a). 



(16) 
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